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Use of a DG153 secreted protein product for preventing and treating 
pancreatic diseases and/or obesity and/or metabolic syndrome 

Description 

Tliis invention relates to the use of low molecular weight DG1 53 proteins, 
to the use of polynucleotides encoding these, and to the use of effectors 
/ modulator thereof in the diagnosis, study, prevention, and treatment of 
pancreatic diseases (e.g. diabetes mellitus), obesity and/or metabolic 
syndrome and to the use in regeneration of tissues such as pancreatic 
tissues and others. 

Many human proteins serve as pharmaceutlcally active compounds. 
Several classes of human proteins that serve as such active compounds 
include hormones, cytokines, cell growth factors, and cell differentiation 
factors. Most proteins that can be used as a pharmaceutlcally active 
compound fall within the family of secreted proteins. Secreted proteins are 
generally produced within cells at rough endoplasmic reticulum, are then 
exported to the golgi complex, and then move to secretory vesicles or 
granules, where they are secreted to the exterior of the cell via exocytosis. 
Examples for commercially used secreted proteins are human insulin, 
thrombolytic agents, interferons, interleukins. colony stimulating factors, 
human growth hormone, transforming growth factor beta, tissue 
plasminogen activator, erythropoeitin, and various other proteins. 
Receptors of secreted proteins, which are membrane-bound proteins, also 
have potential as therapeutic or diagnostic agents. It is, therefore, 
important for developing new pharmaceutical compounds to identify 
secreted proteins that can be tested for activity in a variety of animal 
models. Thus, in light of the pen^asive role of secreted proteins in human 
physiology, a need exists for identifying and characterizing novel functions 
for human secreted proteins and the genes that encode-them. This 
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knowledge will allow one to detect, to treat, and to prevent medical 
diseases, disorders, and/or conditions by using secreted proteins or the 
genes that encode them. 

The pancreas is an essential organ possessing both an exocrine function 
involved in the delivery of en^ymes into the digestive tract and an 
endocrine function by which various hormones are secreted into the blood 
stream. The exocrine function is assured by acinar and centroacinar cells 
that produce various digestive enzymes and intercalated ducts that 
transport these enzymes in allcaline solution to the duodenum. The 
functional unit of the endocrine pancreas is the islet of Langerhans. Islets 
are scattered throughout the exocrine portion of the pancreas and are 
composed of four cell types: alpha-, beta-, delta- and PP-cells, reviewed for 
example in Kim & Hebrok, 2001, Genes & Development 15:111-127. 
Beta-cells produce insulin, represent the majority of the endocrine cells 
and form the core of the islets, while alpha-cells secrete glucagon and are 
located in the periphery. Delta-cells and PP-cells are less numerous and 
secrete somatostatin and pancreatic polypeptide, respectively. 

Early pancreatic development has been well studied in different species, 
including chicken, zebrafisii, and mice (for an detailed review, see Kim & 
Hebrock, 2001, supra). The pancreas develops from distinct dorsal and 
ventral aniagen. Pancreas development requires specification of the 
pancreas aniage along both anterior-posterior and dorsal-ventral axes. A 
number of transcription factors which are critical for proper pancreatic 
development have been identified (see Kim & Hebrok, 2001, Genes & 
Development 15:111-127; T: Wilson, Scheel & German Mech Dev. 
120:65-80). 

Later in life, the acinar and ductal cells retain a significant proliferative 
capacity that can ensure cell renewal and growth, whereas the islet cells 
become mostly mitotically Inactive, it has been suggested, that during 
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embryonic development, and probably later in life, pancreatic islets of 
Langerhans originate from differentiating epithelial stem cells. These stem 
cells are situated in the pancreatic ductal epithelium or close to the 
pancreatic ducts but are otherwise poorly characterized (Bonner-Weir & 
Sharma A, J Pathol- 197:519-26). However, also an intra-islet location or 
an origin in the bone marrow has been suggested for precursor cells of 
adult beta cells (Zulewski et al.. Diabetes 50:521-33; lanus et al., J Clin 
Invest. 1 1 1 :843-50). The early progenitor cells to the pancreatic islets are 
multipotential and coactivate an early endocrine gene expression program. 
As development proceeds^ expression of islet-specific hormones becomes 
restricted to the pattern of expression characteristic for mature islet cells. 
Pancreatic islet growth is dynamic and responds to changes in insulin 
demand, such as during pregnancy or during the increase in body mass 
occuring during childhood. 

Many pancreas diseases are associated with defects in pancreatic 
architecture, but the molecular mechanisms underlying these defects are 
basically unknown. However, studies have shown that signaling pathways 
Influence pancreatic cell fates as well as the morphogenesis of pancreatic 
structures, for example FGF signaling, activin signaling, the Hedgehog 
pathway, notch signaling, vascular epithelial growth factor (VEGF) 
signaling, and transforming growth factor (TGF)-beta signaling pathway. 
However, much remains to be learned about the precise roles of these 
pathways. In addition, many other extracellular signals and pathways 
controlling the development of the pancreas remain to be identified. 

Pancreatic beta-cells secrete insulin which is stimulated by high blood 
glucose levels. Insulin amongst other hormones plays a key role in the 
regulation of the fuel metabolism. Insulin leads to the storage of glycogen 
and triglycerides and to the synthesis of proteins. The entry of glucose into 
muscles and adipose cells is stimulated by insulin. In patients who suffer 
from diab-eces mslHtus either ihs amount of insulin produced by the 



pancreatic islet cells is too low (diabetes type I or insulin dependent 
diabetes mellitus, IDDM) or liver and muscle cells loose their ability to 
respond to normal blood insulin levels (insulin resistance). In the next stage 
pancreatic cells become unable to produce sufficient amounts of Insulin 
(diabetes type II or non insulin dependent diabetes mellitus, NIDDM). 
Diabetes is a very disabling disease, because medications do not control 
blood sugar levels well enough to prevent swinging between high and low 
blood sugar levels. Patients with diabetes are at risk for major 
complications, including diabetic ketoacidosis, end-stage renal disease, 
diabetic retinopathy and amputation. There are also a host of related 
conditions, such as metabolic syndrome, obesity, hypertension, heart 
disease, peripheral vascular disease, and infections, for which persons 
with diabetes are at substantially increased risk. 

Obesity is one of the most prevalent metabolic disorders in the world. It is 
still a poorly understood human disease that becomes as a major health 
problem more and more relevant for western society. Obesity is defined as 
a body weight more than 20% in excess of the Ideal body weight, 
frequently resulting in a significant impairment of health. Obesity may be 
measured by body mass index, an indicator of adiposity or fatness. Further 
parameters for defining obesity are waist circumferences, skinfold 
thickness and bioimpedance. It is associated with an increased risk for 
cardiovascular disease, hypertension, diabetes mellitus type II, 
hyperlipidaemia and an increased mortality rate. Obesity is influenced by 
genetic, metabolic, biochemical, psychological, and behavioral factors and 
• can be caused by different reasons such as non-insulin dependent 
diabetes, increase in triglycerides, increase in carbohydrate bound energy 
and low energy expenditure (Kopelman P.G., (2000) Nature 404, 
635-643). 

The concept of 'metabolic syndrome' (syndrome x, insulin-resistance 
syndrome, deadly quartet) was first described 1966 by Camus and 



reintroduced 1988 by Reaven (Camus JP, 1966, Rev Rhum Mai Osteoartic 
33{1):10-14; Reaven et al. 1988, Diabetes, 37(1 2):1 595-1 607). Today, 
metabolic syndrome is commonly defined as clustering of cardiovascular 
risk factors like hypertension, abdominal obesity, high blood levels of 
triglycerides and fasting glucose as well as low blood levels of HDL 
cholesterol. Insulin resistance greatly increases the risk of developing the 
metabolic syndrome (Reaven, 2002, Circulation 106(3): 286-288). The 
metabolic syndrome often precedes the development of type II diabetes 
and cardiovascular disease (McCook, 2002, JAMA 288:2709-2716). The 
control of blood lipid levels and blood glucose levels is essential for the 
treatment of the metabolic syndrome (see, for example, Santomauro A. T. 
etal., (1999) Diabetes, 48(9):1 836-1 841). 

The molecular factors regulating food intake and body weight balance are 
incompletely understood. Even if several candidate genes have been 
described which are supposed to influence the homeostatic system(s) that 
regulate body mass/weight, like leptin or the peroxisome 
proliferator-activated receptor-gamma co-activator, the distinct molecular 
mechanisms and/or molecules influencing obesity or body weight/body 
mass regulations are not known. 

There is a need in the prior art for the identification of candidate genes that 
are specifically expressed in early development in certain pancreatic 
tissues. These gends and the thereby encoded proteins can provide tools 
to the diagnosis and treatment of severe pancreatic disorders and related 
diseasfes. Therefore, this Invention describes a secreted proteins that are 
specifically expressed in pancreatic tissues early in the development. The 
invention relates to the use of these genes and proteins in the diagnosis, 
prevention and/or treatment of pancreatic dysfunctions, such as diabetes, 
and other related diseases such as obesity and/or metabolic syndrome. 
These proteins and genes are especially useful in regeneration processes, 
such as.reqeneration of the pancrasc cells* 
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In this Invention, we disclose a secreted factor referred to as DG1 53 which 
is involved in pancreas development, regeneration, and In the regulation of 
energy homeostasis. The mRNA encoding this low molecular weight 
protein was described to be expressed In human liver (see Kawamoto et 
al.. Gene 1996 174(1):151-8). The highly conserved arginine-rich protein 
was shown to be mutated in different cancers, such as renal cell 
carcinomas, pancreatic, lung, breast, and prostate cancers, and in 
squamous cell carcinoma of the head and neck (Shridhar et al. Oncogene 
12 (9), 1931-1939 (1996), Shridhar et al. Cancer Res. 56 (24), 
5576-5578 (1996), Shridhar et al. Oncogene 14(18), 2213-2216 (1997)). 
The mutations associated with these cancers appear to be centered in an 
imperfect trinucleotide repeat that putatively encodes a stretch of 1 5 to 1 8 
arginines. The association of these mutations with cancer suggests that 
the putative protein may be a growth factor that plays an important role In 
the regulation of cell growth and development. No further functional data 
are available in the scientific prior art for this protein. 

However, the protein and possible further uses in different disease 
Indications are disclosed in several patent applications. For example, 
DG153 is described as human shear stress response protein useful in the 
diagnosis, treatment and screening of vascular diseases caused by 
arteriosclerosis. Including heart failure, post-PTCA restenosis and 
hypertension (see WO 01/25427)). Patent application W002/74956 
describes the a 98% identical protein as useful for treating 
neurodegenerative diseases (such as Parkinson's disease or Alzheimer's 
diseases). A 97% identical protein is described in WO 02/90541 useful for 
diagnosing behavioral disorder, or assessing the likelihood of developing 
behavioral disorder (like Attention Deficit Hyperactivity Disorder or 
intellectual disorders). The precursor protein (179 amino acids) is 
described in patent application WO2001 1 9851 for treating nervous system 
diseases or disorders (such as Parkinson's disease), and for transplanting 
ceils into nervous system. A 180 amino acid fragment of DG153 is 
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described in WO 01/70174 as being useful for modulating angiogenesis 
and/or apoptosis for preventing or treating cancer, myocardial infarction 
and promoting healing, by modulating the activity of vascular endothelial 
growth factor-modulated gene polypeptide. 

5 

To the best of our knowledge, no disclosure has been made in the prior art 
that describes a function of DG153 in pancreatic disorders including 
diabetes, in obesity or metabolic syndrome. In addition, a role of the 
secreted factor DG153 in the regeneration of tissues such as pancreatic 

10 tissues and others is novel and surprising. Thus, since secreted proteins 
' are a major target for drug action and development, it is of high value to 
identify and characterize novel functions for secreted proteins. The present 
invention advances the state of the art by providing previously unknown 
functions for a low molecular weight human secreted protein, DG 1 53. This 

15 satisfies a need in the art by providing new therapeutic strategies for 
treating pancreatic diseases (e.g. diabetes), obesity, and/or metabolic 
syndrome, for example, treating diabetes by regeneration of beta cells. The 
DG153 can be used to promote a regeneration of, for exmpte, pancreas 
cells which then start producing insulin on their own. Therefore, further 

20 discovery and development of such novel functions for secreted proteins 
like DG153 would have a strong beneficial affect on medical services and 
healthcare. 

Accordingly, the present invention relates to a secreted protein with novel 
25 functions in the human metabolism, regeneration and developmental 
processes. The present invention discloses specific genes and proteins 
encoded thereby and effectors/modulators thereof involved in the 
regulation of pancreatic function and metabolism, especially in pancreas 
diseases such as diabetes mellitus, e.g. insulin dependent diabetes mellitus 
30 and/or non-insulin dependent diabetes mellitus, and/or metabolic syndrome, 
obesity, and/or related disorders such as coronary heart disease, eating 
disorder, C3che::ia. hypertension, hypercholesterolemia (dyslipidemia), liver 



fibrosis, and/or gallstones. Further, the present invention discloses specific 
genes and proteins encoded thereby and effectors/modulators thereof 
involved in the modulation, e.g. stimulation, of pancreatic development 
and/or regeneration of pancreatic cells or tissues, e.g. cells having 
exocrinous functions such as acinar cells, centroacinar cells and/or ductal 
cells, and/or cells having endocrinous functions, particularly cells in 
Langerhans islets such as alpha-, beta-, delta- and/or PP-cells, more 
particularly beta-cells. 

In this invention, we used a screen, for secreted factors expressed in 
developing mammalian (mouse) pancreas, as described in more detail in the 
Examples section (see Example 1). This screen identified DG153 as 
secreted factor expressed in developing mouse pancreas. The present 
invention describes mammalian DG153 proteins and the polynucleotides 
encoding these, in particular human DG153, as being involved in the 
conditions and processes mentioned above. 

The present invention relates to DG153 polynucleotides encoding 
polypeptides with novel functions in the development and regeneration of 
pancreatic tissues and thus in mammalian pancreatic diseases (e.g. 
diabetes), and also in body-weight regulation, energy homeostasis, and 
obesity, fragments of said polynucleotides, polypeptides encoded by said 
polynucleotides or fragments thereof. The invention also relates to vectors, 
host cells, and recombinant methods for producing the polypeptides and 
polynucleotides of the invention. The invention also relates to 
effectors/modulators of DG153 polynucleotides and/or polypeptides, e.g. 
antibodies, biologically active nucleic acids, such as antisense molecules, 
RNAi molecules or ribozymes, aptamers, peptides or low-molecular weight 
organic compounds recognizing said polynucleotides or polypeptides. 

DG153 homologous proteins and nucleic acid molecules coding therefore 
are obtainable from vertebrate species. Particularly preferred are nucleic 



acids encoding the human DG153 protein and variants thereof. The 
invention particulariy relates to a nucleic acid molecule encoding a 
polypeptide contributing to regulating the energy homeostasis and the 
mammalian metabolism, wherein said nucleic acid molecule comprises 

(a) the nucleotide sequence of human DG153 (SEQ ID NO: 1) and/or a 
sequence complementary thereto, 

(b) a nucleotide sequence which hybridizes at 50°C in a solution 
containing 1 x SSC and 0.1% SDS to a sequence of (a), 

(c) a sequence corresponding to the sequences of (a) or (b) within the 
degeneration of the genetic code, 

(d) a sequence which encodes a polypeptide which is at least 85%, 
preferably at least 90%, more preferably at least 95%, more 
preferably at least 98% and up to 99,6% identical to the amino acid 
sequences of the human DG153 protein (SEQ ID NO: 2 or 3), 

(e) a sequence which differs from the nucleic acid molecule of (a) to (d) 
by mutation and wherein said mutation causes an alteration, 
deletion, duplication and/or premature stop in the encoded 
polypeptide or 

(f) a partial sequence of any of the nucleotide sequences of (a) to (e) 
having a length of 15-25 bases, preferably 25-35 bases, more 
preferably 35-50 bases and most preferably at least 50 bases. 

The function of the mammalian DG153 in metabolism was validated by 
analyzing the expression of the transcripts in different tissues and by 
analyzing the role in adipocyte differentiation. 

Expression profiling studies (see Examples for more detail) confirm the 
particular relevance of DG153 as regulator of energy metabolism in 
mammals. Taqman analysis revealed that DG153 is expressed in several 
mammalian tissues, with highest expression levels in testis. In addition, 
DG153 is highly expressed in metabolic active tissue such as white 
adipose tissue (WAT) comparsd to other tissua types in vi/ild type mouse 
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as depicted in FIGURE 2A. BAT is a well characterized tissue which is well 
developed in newborn mamnnals, including humans. One important task of 
BAT is to generate heat and maintain body temperature homeostasis in 
newborn. Thus; an expression of DG153 protein in adipose tissues is 
confirming a role in the regulation of metabolism, particularly energy 
homeostasis and thermogenesis. 

We used mouse models of insulin resistance and/or diabetes, such as mice 
carrying gene knockouts in the leptin pathway (for example, ob/ob (leptin) 
or db (leptin receptor/ligand) mice) to study the expression of DG1 53. Such 
mice develop typical symptoms of diabetes, show hepatic lipid 
accumulation and frequently have increased plasma lipid levels (see 
Bruning et al. 1998, Mol. Cell. 2:449-569). We found, for example, that 
the expression of DG153 is strongly downregulated in metabolic active 
tissue (WAT) in genetically induced obese mice (ob/ob) compared to fasted 
mice (see FIGURE 28). These data further support an essential role of 
DG153 in the regulation of the mammalian metabolism, particularly in 
processes related to , obesity, or metabolic syndrome. 

Expression of DG153 mRNA was also examined in susceptible wild type 
mice (for example, C57BI/6) that show symptoms of diabetes, lipid 
accumulation, and high plasma lipid levels, if fed a high fat diet. In those 
mice, the expression of DG153 is significantly downregulated in WAT 
supporting the finding in ob/ob mice. In addition, the expression of DG1 53 
Is upregulated in BAT and muscle tissues. These results confirm a role of 
DG1 53 is involved in the regulation of mammalian metabolism (see FIGURE 
2C). 

Further, we show (see Examples and FIGURES 2 and 3) that the DG153 
protein has to be downregulated in order for the preadipocyctes to 
differentiate into mature adipocycte. With regard to changes in expression 
intensity during the differentiation of preadipocytes to adipocytes, a slight 
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reduction in relative signal intensity can be observed for DG153 during the 
in vitro differentiation program of -STS-LI (see Figure 2D). Therefore, the 
DG1 53 protein might play an essential role in adipogenesis. The results are 
suggesting a role as modulator of adipogenesis. 

Microarrays are analytical tools routinely used in bioanalysis. A microarray 
has molecules distributed over, and stably associated with, the surface of 
a solid support. The term "microarray" refers to an arrangement of a 
plurality of polynucleotides, polypeptides, antibodies, or other chemical 
compounds on a substrate. Microarrays of polypeptides, polynucleotides, 
and/or antibodies have been developed and find use in a variety of 
applications, such as monitoring gene expression, drug discovery, gene 
sequencing, gene mapping, bacterial identification, and combinatorial 
chemistry. One area in particular in which microarrays find use is in gene 
expression analysis (see Example 4). Array technology can be used to 
explore the expression of a single polymorphic gene or the expression 
profile of a large number of related or unrelated genes. When the 
expression of a single gene is examined, arrays are employed to detect the 
expression of a specific gene or its variants. When an expression profile is 
examined, arrays provide a platform for identifying genes that are tissue 
specific, are affected by a substance being tested in a toxicology assay, 
are part of a signaling cascade, carry out housekeeping functions, or are 
specifically related to a particular genetic predisposition, condition, disease, 
or disorder. 

Microarrays may be prepared, used, and analyzed using methods known in 
the art (see for example, Brennan, T.M. et al. (1995) U.S. Patent No- 
5,474,796- Schena, M. et al. (1996) Proc. Natl. Acad. Sci. USA 
93:10614-10619; Baldeschweiler et al. (1995) PCT application 
W095/25in6; Shalon, D, et al. (1995) PCT application WO95/35505; 
Heller, R.A. etal. (1997) Proc. Natl. Acad. Sci. USA 94:21502155; Heller, 
rvi.J- at al. (1997) U.S. Patent Wo. 5,605/332). Various c\^pes of 
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microarrays are well known and thoroughly described in Schena, M., ed. 
(1999; DNA Microarrays: A Practical Approach, Oxford University Press, 
London). 

Oligonucleotides or longer fragments derived from any of the 
polynucleotides described herein may be used as elements on a microarray. 
The microarray can be used in transcript imaging techniques which monitor 
the relative expression levels of large numbers of genes simultaneously as 
described below. The microarray may also be used to identify genetic 
variants, mutations, and polymorphisms. This information may be used to 
determine gene function, to understand the genetic basis of a disorder, to 
diagnose a disorder, to monitor progression/regression of disease as a 
function of gene expression, and to develop and monitor the activities of 
therapeutic agents in the treatment of disease. In particular, this 
information may be used to develop a pharmacogenomic profile of a 
patient in order to select the most appropriate and effective treatment 
regimen for that patient. For example, therapeutic agents which are highly 
effective and display the fewest side effects may be selected for a patient 
based on his/her pharmacogenomic profile. 

As determined by Microarray analysis, DG153 shows differential 
expression in human primary adipocytes. A downregulation is observed 
concerning the expression of DG153 is during the human adipocyte 
differentiation (see FIGURE 2F). The DG153 protein in preadipocyctes has 
the potential to inhibit adipose differentiation at a very early stage. 
Therefore, the DG1 53 protein might play an essential role in adipogenesis. 
The results are suggesting a role of DG153 in the regulation in human 
metabolism, for example, as effector / modulator (for example, inhibitor) of 
adipogenesis. Thus, DG1 53 is a strong candidate for the manufacture of a 
pharmaceutical composition and a medicament for the treatment of 
conditions related to human metabolism, such as diabetes, obesity, and/or 
metabolic syndrome. 
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The invention also encompasses polynucleotides that encode the proteins 
of the invention and homologous proteins. Accordingly, any nucleic acid 
sequence, which encodes the amino acid sequences of the proteins of the 
invention and homologous proteins, can be used to generate recombinant 
molecules that express the proteins of the invention and homologous 
proteins. In a particular embodiment, the invention encompasses a nucleic 
acid encoding DG1 53. It will be appreciated by those skilled in the art that 
as a result of the degeneracy of the genetic code, a multitude of nucleotide 
sequences encoding the proteins, some bearing minimal homology to the 
nucleotide sequences of any known and naturally occurring gene, may be 
produced. The invention contemplates each and every possible variation of 
nucleotide sequence that can be made by selecting combinations based on 
possible codon choices. 

Also encompassed by the invention are polynucleotide sequences that are 
capable of hybridizing to the claimed nucleotide sequences, and in 
particular, those of the polynucleotide encoding the proteins of the 
invention, under various conditions of stringency. Hybridization conditions 
are based on the melting temperature (Tm) of the nucleic acid binding 
complex or probe, as taught in Wahl & Berger (1987: Methods Enzymol. 
152:399-407) and Kimmel (1987; Methods Enzymol. 152:507-511), and 
may be used at a defined stringency. Preferably, hybridization under 
stringent conditions means that after washing for 1 h with 1 x SSC and 
0.1 % SDS at 50°C, preferably at 55°C, more preferably at 62°C and most 
preferably at 65°C, particularly for 1 h in 0.2 x SSC and 0.1% SDS at 
50°C, pi'feferably at 55°C, more preferably at 62°C and most preferably at 
65 °C, a positive hybridization signal is observed. Altered nucleic acid 
sequences encoding the proteins which are encompassed by the invention 
include deletions, insertions or substitutions of different nucleotides 
resulting in a polynucleotide that encodes the same or a functionally 
equivalent protein. 
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The. encoded proteins may also contain deletions, insertions or 
substitutions of amino acid residues, which produce a silent change and 
result In functionally equivalent proteins. Deliberate amino acid 
substitutions may be made on the basis of similarity in polarity, charge, 

5 solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of 
the residues as long as the biological activity of the protein is retained. 
Furthermore, the invention relates to peptide fragments of the proteins or 
derivatives thereof such as cyclic peptides, retro-inverso peptides or 
peptide mimetics having a length of at least 4, preferably at least 6 and up 

10 to 50 amino acids. 

Also included within the scope of the present invention are alleles of the 
genes encoding the proteins of the invention and homologous proteins. As 
used herein, an 'allele' or 'allelic sequence' is an alternative form of the 

15 gene, which may result from at least one mutation In the nucleic acid 
sequence. Alleles may result in altered mRNAs or polypeptides whose 
structures or function may or may not be altered: Any given gene may 
have none, one or many allelic forms. Common mutational changes, which 
give rise to alleles, are generally ascribed to natural deletions, additions or 

20 substitutions of nucleotides. Each of these types of changes may occur 
alone or in combination with the others, one or more times in a given 
sequence. 

The nucleic acid sequences encoding DG153 and homologous proteins 
25 may be extended utilizing a partial nucleotide sequence and employing 
various methods known in the art to detect upstream sequences such as 
promoters and regulatory elements. 

In order to express a biologically active protein, the nucleotide sequences 
30 encoding the proteins or functional equivalents, may be inserted into 
appropriate expression vectors, i.e., a vector which contains the necessary 
elements for the transcription and translation of the inserted coding 
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sequence. Methods, which are well known to those skilled in the art, may 
be used to construct expression vectors containing sequences encoding 
the proteins and the appropriate transcriptional and translational control 
elements. Regulatory elements include for example a promoter, an initiation 
codon, a stop codon, a mRNA stability regulatory element, and a 
polyadenylation signal. Expression of a polynucleotide can be assured by (i) 
constitutive promoters such as the Cytomegalovirus (CMV) 
promoter/enhancer region, (ii) tissue specific promoters such as the insulin 
promoter (see, Soria et al., 2000, Diabetes 49:157), SOX2 gene promoter 
(see Li et al., (1998) Curr. Biol. 8:971-974), Msi-1 promoter (see 
Sakakibara et al., (1997) J. Neuroscience 17:8300-8312), alpha-cardia 
myosin heavy chain promoter or human atrial natriuretic factor promoter 
(Klug et aL, (1996) J. din. Invest 98:216-224; Wu et a!., (1989) J. Biol. 
Chem. 264:6472-6479) or (iii) inducible promoters such as the tetracycline 
inducible system. Expression vectors can also contain a selection agent or 
marker gene that confers antibiotic resistance such as the neomycin, 
hygromycin or puromycin resistance genes. These methods include In vitro 
recombinant DNA techniques, synthetic techniques, and in vivo genetic 
recombination. Such techniques are described in Sambrook, J. et al. 
(1989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, 
Plainview, N.Y. and Ausubel, P.M. et al. (1989) Current Protocols in 
Molecular Biology, John Wiley & Sons, New York, N.Y. 

In a further embodiment of the Invention, natural, modified or recombinant 
nucleic acid sequences encoding the proteins of the invention and 
homologous proteins may be ligated to a heterologous sequence to encode 
a fusion protein.- 

A variety of expression vector/host systems, as known in the art, may be 
utilized to contain and express sequences encoding the proteins or fusion 
proteins. These include, but are not limited to, micro-organisms such as 

bacteria trans^formBd with recombinant bacteriophage, plasmid or cosmid 
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DNA expression vectors; yeast transformed with yeast expression vectors; 
Insect cell systems infected with virus expression vectors (e.g.. 
baculovirus, adenovirus, adeno-assoclated virus, lentlverus, retrovirus); 
plant cell systems transformed with virus expression vectors (e.g., 
cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or with 
bacterial expression vectors (e.g., Ti or PBR322 plasmids); or animal cell 
systems. 

The presence of polynucleotide sequences of the invention in a sample can 
be detected by DNA-DNA or DNA-RNA hybridization and/or amplification 
using probes or portions or fragments of said polynucleotides. Nucleic acid 
amplification based assays involve the use of oligonucleotides or oligomers 
based on the sequences specific for the gene to detect transformants 
containing DNA or RNA encoding the corresponding protein. As used 
herein 'oligonucleotides' or 'oligomers' refer to a nucleic acid sequence of 
at least about 10 nucleotides and as many as about 60 nucleotides, 
preferably about 1 5 to 30 nucleotides, and more preferably about 20-25 
nucleotides, which can be used as a probe or amplimer . 

A wide variety of labels and conjugation techniques are known by those 
skilled in the art and may be used in various nucleic acid and amino acid 
assays. Means for producing labeled hybridization or PGR probes for 
detecting polynucleotide sequences include oligo-labeling, nick translation, 
end-labeling of RNA probes, PGR amplification using a labeled nucleotide, 
or enzymatic synthesis. These procedures may be conducted using a 
variety of commercially available kits (Pharmacia & Upjohn, (Kalamazoo, 
Mich.); Promega (Madison Wis.); and U.S. Biochemical Gorp., (Gleveland, 
Ohio). 

The presence of DG153 in a sample can be determined by immunological 
methods or activity measurement. A variety of protocols for detecting and 
measuring the expression of proteins, using either polyclonal or monoclonal 
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antibodies specific for the protein or reagents for determining protein 
activity are known In the art. Examples include enzyme*linked 
immunosorbent assay (ELISA), radioimmunoassay (RIA), and fluorescence 
activated cell sorting (FACS). A two-site, monoclonal-based immunoassay 
utilizing monoclonal antibodies reactive to two non-interfering epitopes on 
the protein is preferred, but a competitive binding assay may be employed. 
These and other assays are described, among other places, in Hampton, R. 
et aL (1990; Serological Methods, a Laboratory Manual, APS Press, St 
Paul, Minn.) and Maddox, D. E. et al. (1983; J. Exp. Med. 
158:1211-1216). 

Suitable reporter molecules or labels, which may be used, include 
radionuclides, enzymes, fluorescent, chemiluminescent or chromogenic 
agents as well as substrates, co-factors, inhibitors, magnetic particles, and 
the like. 

The nucleic acids encoding the proteins of the invention can be used to 
generate transgenic animal or site specific gene modifications in cell lines. 
Transgenic animals may be made through homologous recombination, 
where the normal locus of the genes encoding the proteins of the invention 
is altered. Alternatively, a nucleic acid construct is randomly integrated into 
the genome. Vectors for stable integration include plasmids, retrovirusses 
and other animal virusses, YACs, and the like. The modified cells or animal 
are useful in the study of the function and regulation of the proteins of the 
invention. For example, a series of small deletions and/or substitutions may 
be made in the genes that encode the proteins of the invention to 
determine the role of particular domains of the protein, functions in 
pancreatic differentiation, etc. 

Specific constructs of interest include anti-sense molecules, which will 
block the expression of the proteins of the invention, or expression of 
dominant negative mutations. A detectable marker, such as for e?cample 
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lac-Z, may be introduced in the locus of the genes of the invention, where 
upregulation of expression of the genes of the invention will result in an 
easily detected change in phenotype. 

One may also provide for expression of the genes of the invention or 
variants thereof in cells or tissues where it is not normally expressed or at 
abnormal times of development. In addition, by providing expression of the 
proteins of the invention in cells in which they are not normally produced, 
one can induce changes in cell behavior. 

DNA constructs for homologous recombination will comprise at least 
portions of the genes of the invention with the desired genetic 
modification, and will include regions of homology to the target locus. DNA 
constructs for random integration need not include regions of homology to 
mediate recombination. Conveniently, markers for positive and/or negative 
selection are included. Methods for generating cells having targeted gene 
modifications through homologous recombination are known in the art. For 
non-humanembryonic stem (ES) cells, an ES cell line may be employed, or 
embryonic cells may be obtained freshly from a host, e.g. mouse, rat, 
guinea pig etc. Such cells are grown on an appropriate fibroblast-feeder 
layer or grown in presence of leukemia inhibiting factor (LIF). 

The data disclosed in this invention show that the DG1 53 nucleic acids and 
proteins and effector/modulator molecules thereof are useful in diagnostic 
and therapeutic applications implicated, for example, but not limited to, 
pancreatic diseases (e.g. diabetes mellitus such as insulin dependent 
diabetes mellitus and/or non insulin dependent diabetes mellitus), obesity, 
metabolic syndrome, eating disorder, cachexia, hypertension, coronary 
heart disease, hypercholesterolemia (dyslipidemia), and/or gallstones. 
Further, the data show that the DG153 nucleic acids and proteins and 
effector/modulator molecules thereof are useful for the modulation, e.g. 
stimulation, of pancreatic development and/or for the regeneration of 
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pancreatic cells or tissues, e.g. cells having exocrinous functions such as 
acinar cells, centroacinar cells and/or ductal cells, and/or cells having 
endocrinous functions, particularly cells in Langerhans islets such as alpha- 
, beta-, delta- and/or PP-cells, more particularly beta-cells. Hence, 
diagnostic and therapeutic uses for the proteins of the invention nucleic 
acids and proteins of the invention are, for example but not limited to, the 
following: (i) tissue regeneration in vitro and in vivo (regeneration for all 
these tissues and cell types composing these tissues and cell types derived 
from these tissues), (ii) small molecule drug target, (iii) antibody target 
(therapeutic, diagnostic, drug targeting/cytotoxic antibody), (iv) diagnostic 
and/or prognostic marker, (v) protein therapy, (vi) gene therapy (gene 
delivery/gene ablation), and / or (vii) research tools. 

For example, but not limited to, cDNAs encoding the proteins of the 
invention and particularly their human homologues may be useful in 
stimulating, enhancing or regulating the regeneration of tissues, and the 
proteins of the invention and particularly their human homologues may be 
useful when administered to a subject in need thereof. By way of 
non-limiting example, the compositions of the present invention will have 
efficacy for treatment of patients suffering from, for example, pancreatic 
diseases (e.g. diabetes), obesity, and/or metabolic syndrome as described 
above » 

In one embodiment of the invention, administration of DG1 53 nucleic acids 
and proteins and/or effectors/modulators thereof in a pharmaceutical 
composition to a subject in need thereof, particularly a human patient, 
leads to an at least partial regeneration of, for example, pancreas cells. The 
composition will then at least partially restore normal pancreatic function. 
In one example, these cells are beta cells of the islets which then start 
producing insulin on their own. After administration of this composition 
e.g. on a regular basis, the beta cells of a diabetic subject will grow back 
CO approach the nomial size and number present in a nondiabetic subject. 
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Thls effect upon the body reverses the condition of diabetes. As the 
subject's blood sugar level improves, the dosage administered may be 
reduced in strength. In at least some cases further administration can be 
discontinued entirely and the subject continues to produce a normal 
amount of insulin without further treatment. The subject is thereby not 
only treated but cured entirely of a diabetic condition. Further, beta cells or 
precursors thereof may be treated in vitro and implanted or reimplanted 
into a subject in need thereof. Furthermore, other cells of the pancreas can 
be regenerated in vivo and/or in vrtro to cure a certain condition. 

Beside diabetes, the compositions of the present invention will also have 
efficacy for treatment of patients with other pancreatic diseases such 
pancreatic cancer, dysplasia or pancreatitis. 

The DG1 53 nucleic acids and proteins and effectors/modulators thereof are 
useful in diagnostic and therapeutic applications implicated in various 
embodiments as described below. For example, but not limited to, cDNAs 
encoding the proteins of the invention and particularly their human 
homologues may be useful in gene therapy, and the proteins of the 
invention and particularly their human homologues may be useful when 
administered to a subject in need thereof. By way of non-limiting example, 
the compositions of the present invention will have efficacy for treatment 
of patients suffering from, for example, pancreatic diseases (e.g. diabetes), 
obesity, and/or metabolic syndrome as described above. 

The nucleic acids of the invention or fragments thereof, may further be 
useful in diagnostic applications, wherein the presence or amount of the 
nucleic acids or the proteins are to be assessed. Further antibodies that 
bind immunospeciflcally to the novel substances of the invention may be 
used in therapeutic or diagnostic methods. 
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For example, in one aspect, antibodies, wliich are specific for the proteins 
of the invention and homologous proteins, may be used directly as an 
effector/modulator, e.g. an antagonist or an agonist, or indirectly as a 
targeting or delivery mechanism for bringing a pharmaceutical agent to 
cells or tissue which express the protein. The antibodies may be generated 
using methods that are well known in the art. Such antibodies may include, 
but are not limited to, polyclonal, monoclonal, chimeric single chain. Fab 
fragments, and fragments produced by a Fab expression library. 
Neutralising antibodies, (i.e., those which inhibit dimer formation) are 
especially preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, 
rats, mice, humans, and others, may be immunized by injection with the 
protein or any fragment or oligopeptide thereof which has immunogenic 
properties. Depending on the host species, various adjuvants may be used 
to increase immunological response. It is preferred that the peptides, 
fragments or oligopeptides used to induce antibodies to the protein have an 
amino acid sequence consisting of at least five amino acids, and more 
preferably at least 10 amino acids. 

Monoclonal antibodies to the proteins may be prepared using any 
technique that provides for the production of antibody molecules by 
continuous cell lines in culture. These include, but are not limited to, the 
hybridoma technique, the human B-cell hybridoma technique, and the 
EBV-hybridoma technique (K6hler, G. et al. (1975) Nature 256:495-497; 
Kozbor, D. et al. (1985) J. Immunol. Methods 81:31-42; Cote, R. J. et al. 
Proc. Nati. Acad. Sci. 80:2026-2030; Cole, S. P. et al. (1984) Mol. Cell 
Biol. 62:109-120). 

In addition, techniques developed for the production of 'chimeric 
antibodies', the splicing of mouse antibody genes to human antibody genes 
to obtain a molecule with appropriate antigen specificity and biological 
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activity can ba used (Morrison, S. L. et al. (1984) Proc. Natl. Acad. Sci. 
81:6851-6855; Neuberger, M. S. et al (1984) Nature 312:604-608; 
Takeda, S. et al. (1985) Nature 314:452-454). Alternatively, techniques 
described for the production of single chain antibodies may be adapted, 
using methods known in the art, to produce single chain antibodies specific 
for the proteins of the invention and homologous proteins. Antibodies with 
related specificity, but of distinct idiotypic composition, may be generated 
by chain shuffling from random combinatorial immunoglobulin libraries 
(Burton, D. R. (1991) Proc. Natl. Acad. Sci. 88:1 1 120-1 11 23). Antibodies 
may also be produced by inducing in vivo production in the lymphocyte 
population or by screening recombinant immunoglobulin libraries or panels 
of highly specific binding reagents as disclosed In the literature (Orlandi, R. 
etal. (1989) Proc. Natl. Acad. Sci. 86:3833-3837; Winter, G. etal. (1991) 
Nature 349:293-299). 

Antibody fragments, which contain specific binding sites for the proteins 
may also be generated. For example, such fragments include, but are not 
limited to, the F(ab')2 fragments which can be produced by Pepsin 
digestion of the antibody molecule and the Fab fragments which can be 
generated by reducing the disulfide bridges of F(ab')2 fragments. 
Alternatively, Fab expression libraries may be constructed to allow rapid 
and easy identification of monoclonal Fab fragments with the desired 
specificity (Huse, W. D. et al. (1989) Science 254:1275-1281). 

Various immunoassays may be used for screening to identify antibodies 
having the desired specificity. Numerous protocols for competitive binding 
and immunoradiometric assays using either polyclonal or monoclonal 
antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation 
between the protein and its specific antibody. A two-site, 
monoclonal-based immunoassay utilizing monoclonal antibodies reactive to 
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two non-interfering protein epitopes are preferred, but a competitive 
binding assay may also be employed (Maddox, supra). 

In another embodiment of the invention, the polynucleotides or fragments 
thereof or nucleic acid effector/modulator molecules such as antisense 
molecules, aptamers, RNAi molecules or ribozymes may be used for 
therapeutic purposes. In one aspect, aptamers, i.e. nucleic acid molecules, 
which are capable of binding to a protein of the invention and modulating 
its activity, may be generated by a screening and selection procedure 
involving the use of combinatorial nucleic acid libraries. 

In a further aspect, antisense molecules may be used in situations in which 
it would be desirable to block the transcription of the mRNA. In particular, 
cells may be transformed with sequences complementary to 
polynucleotides encoding DG153 and homologous proteins. Thus, 
antisense molecules may be used to modulate / effect protein activity or to 
achieve regulation of gene function. Such technology is now well known in 
the art, and sense or antisense oligomers or larger fragments, can be 
designed from various locations along the coding or control regions of 
sequences encoding the proteins. Expression vectors derived from 
retroviruses, adenovirus, herpes or vaccinia viruses or from various 
bacterial plasmids may be used for delivery of nucleotide sequences to the 
targeted organ, tissue or cell population. Methods, which are well known 
to those skilled in the art, can be used to construct recombinant vectors, 
which will express antisense molecules complementary to the 
polynucleotides of the genes encoding the proteins of the invention and 
homologous proteins. These techniques are described both in Sambrook et 
al. (supra) and in Ausubel et al. (supra). Genes encoding the proteins of the 
invention and homologous proteins can be turned off by transforming a cell 
or tissue with expression vectors, which express high levels of 
polynucleotides that encode the proteins of the invention and homologous 
proteins or fragments thereof- Such constructs may be used to introduce 
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untranslatable sense or antisense sequences into a cell. Even in the 
absence of integration into the DNA, such vectors may continue to 
transcribe RNA molecules until they are disabled by endogenous nucleases. 
Transient expression may last for a month or more with a non-replicating 
vector and even longer if appropriate replication elements are part of the 
vector system. 

As mentioned above, modifications of gene expression can be obtained by 
designing antisense molecules, e.g. DNA, RNA or nucleic acid analogues 
such as PNA, to the control regions of the genes encoding DG153 and 
homologous proteins, i.e., the promoters, enhancers, and introns. 
Oligonucleotides derived from the transcription initiation site, e.g., between 
positions -10 and +10 from the start site, are preferred. Similarly, 
inhibition can be achieved using "triple helix" base-pairing methodology. 
Triple helix pairing is useful because it cause inhibition of the ability of the 
double helix to open sufficiently for the binding of polymerases, 
transcription factors or regulatory molecules. Recent therapeutic advances 
using triplex DNA have been described in the literature (Gee, J. E. et al. 
(1994) In; Huber, B. E. and B. I. Carr, Molecular and Immunologic 
Approaches, Futura Publishing Co., Mt. Kisco, N.Y.). The antisense 
molecules may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the 
specific cleavage of RNA. The mechanism of ribozyme action involves 
sequence-specific hybridization of the ribozyme molecule to complementary 
target RNA, followed by endonucleolytic cleavage. Examples, which may 
be used, include engineered hammerhead motif ribozyme molecules that 
can be specifically and efficiently catalyze endonucleolytic cleavage of 
sequences encoding the proteins of the invention and homologous 
proteins. Specific ribozyme cleavage sites within any potential RNA target 
are initially identified by scanning the target molecule for ribozyme 
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cleavage sites which include the following sequences: GUA, GUU, and 
GUC. Once identified, short RNA sequences of between 15 and 20 
ribonucleotides corresponding to the region of the target gene containing 
the cleavage site may be evaluated for secondary structural features which 
may render the oligonucleotide inoperable. The suitability of candidate 
targets may also be evaluated by testing accessibility to hybridization with 
complementary oligonucleotides using ribonuclease protection assays. 

Nucleic acid effector/modulator molecules, e.g. antisense molecules and 
ribozymes may be prepared by any method known in the art for the 
synthesis of nucleic acid molecules. These include techniques for 
chemically synthesizing oligonucleotides such as solid phase 
phosphoramidite chemical synthesis. Alternatively, RNA molecules may be 
generated by in vitro and in vivo transcription of DNA sequences. Such 
DNA sequences may be incorporated into a variety of vectors with suitable 
RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA 
constructs that synthesize antisense RNA constitutively or inducibly can be 
introduced into cell lines, cells or tissues. RNA molecules may be modified 
to increase intracellular stability and half-life. Possible modifications 
include, but are not limited to, the addition of flanking sequences at the 5' 
and/or 3' ends of the molecule or modifications in the nucleobase, sugar 
and/or phosphate moieties, e.g. the use of phosphorothioate or 2' 
0-methyl rather than phosphodiesterase linkages within the backbone of 
the molecule. This concept is inherent in the production of PNAs and can 
be extended in all of these molecules by the inclusion of non-traditional 
bases such as inosine, queosine, and wybutosine, as well as acetyl-, 
methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine, 
thymine, and uridine which are not as easily recognized by endogenous 
endonucleases. 

Many methods for introducing vectors into cells or tissues are available and 
equally suitabJe for use in vivo, in vitro, and e:c vivo. For e:i vivo therapy. 
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vectors may be introduced into stem cells talcen from the patient and 
clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection and by liposome injections may be achieved using 
methods, which are well known In the art. Any of the therapeutic methods 
described above may be applied to any suitable subject including, for 
example, mammals such as dogs, cats, cows, horses, rabbits, monkeys, 
and most preferably, humans. 

An additional embodiment of the invention relates to the administration of 
a pharmaceutical composition, in conjunction with a pharmaceutically 
acceptable carrier, for any of the therapeutic effects discussed above. 
Such pharmaceutical compositions may consist of DG153 nucleic acids 
and the proteins and homologous nucleic acids or proteins, antibodies to 
the proteins of the invention and homologous proteins, mimetlcs, agonists, 
antagonists or inhibitors of the proteins of the invention and homologous 
proteins or nucleic acids. The compositions may be administered alone or 
in combination with at least one other agent, such as stabilizing 
compound, which may be administered in any sterile, biocompatible 
pharmaceutical carrier, including, but not limited to, saline, buffered saline, 
dextrose, and water. The compositions may be administered to a patient 
alone or in combination with other agents, drugs or hormones. The 
pharmaceutical compositions utilized in this invention may be administered 
by any number of routes including, but not limited to, oral, intravenous, 
intramuscular, intra-arterial, intramedullary, intrathecal, intraventricular, 
transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, 
sublingual or rectal means. 

In addition to the active ingredients, these pharmaceutical compositions 
may contain suitable pharmaceutically-acceptable carriers comprising 
excipients and auxiliaries, which facilitate processing of the active 
compounds into preparations, which can be used pharmaceutically. Further 
details on techniques for formulation and administration may be found in 
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the latest edition of Remington's Pharmaceutical Sciences (Maack 
Publishing Co., Easton, Pa.). 

Pharmaceutical compositions suitable for use in the invention include 
compositions wherein the active ingredients are contained in an effective 
amount to achieve the intended purpose. The determination of an effective 
dose is well within the capability of those skilled in the art. For any 
compounds, the therapeutically effective dose can be estimated initially 
either in cell culture assays, e.g., of preadipocyte cell lines or in animal 
models, usually mice, rabbits, dogs or pigs. The animal model may also be 
used to determine the appropriate concentration range and route of 
administration. Such information can then be used to determine useful 
doses and routes for administration in humans. A therapeutically effective 
dose refers to that amount of active ingredient, for example the DG153 
nucleic acids or proteins or fragments thereof or antibodies, which is 
sufficient for treating a specific condition. Therapeutic efficacy and toxicity 
may be determined by standard pharmaceutical procedures in cell cultures 
or experimental animals, e.g., ED50 (the dose therapeutically effective in 
50% of the population) and LD50 (the dose lethal to 50% of the 
population). The dose ratio between therapeutic and toxic effects is the 
therapeutic index, and it can be expressed as the ratio, LD50/ED50. 
Pharmaceutical compositions, which exhibit large therapeutic indices, are 
preferred. The data obtained from cell culture assays and animal studies is 
used in formulating a range of dosage for human use. The dosage 
contained in such compositions is preferably within a range of circulating 
concentrations that include the ED50 with little or no toxicity. The dosage 
varies within this range depending upon the dosage from employed, 
sensitivity of the patient, and the route of administration. The exact dosage 
will be determined by the practitioner, in light of factors related to the 
subject that requires treatment. Dosage and administration are adjusted to 
orovide sufficient levels of the active moiety or to maintain the desired 
dV-fect. Factors^, which may be taken into account, include. tht3 severity of 
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the disease state, general health of the subject, age, weight, and gender of 
the subject, diet, time and frequency of administration, drug 
combination(s), reaction sensitivities, and tolerance/response to therapy. 
Long-acting pharmaceutical compositions may be administered every 3 to 
4 days, every week or once every two weeks depending on half-life and 
clearance rate of the particular formulation. Normal dosage amounts may 
vary from 0.1 to 100,000 microg, up to a total dose of about 1 g, 
depending upon the route of administration. Guidance as to particular 
dosages and methods of delivery is provided in the literature and generally 
available to practitioners in the art. Those skilled in the art employ different 
formulations for nucleotides than for proteins or their inhibitors. Similarly, 
delivery of polynucleotides or polypeptides will be specific to particular 
cells, conditions, locations, etc. 

In another embodiment, antibodies which specifically bind to the proteins 
may be used for the diagnosis of conditions or diseases characterized by or 
associated with over- or underexpression of the proteins of the invention 
and homologous proteins or in assays to monitor patients being treated 
with the proteins of the invention and homologous proteins, or effectors / 
modulators thereof, e.g. agonists, antagonists, or inhibitors. Diagnostic 
assays include methods which utilize the antibody and a label to detect the 
protein in human body fluids or extracts of cells or tissues. The antibodies 
may be used with or without modification, and may be labeled by joining 
them, either covaleritly or non-covalently, with a reporter molecule. A wide 
variety of reporter molecules, which are known in the art may be used 
several of which are described above. 

A variety of protocols including ELISA, RIA, and FACS for measuring 
proteins are known in the art and provide a basis for diagnosing altered or 
abnormal levels of gene expression. Normal or standard values for gene 
expression are established by combining body fluids or cell extracts taken 
from normal mammalian subjects, preferably human, with antibodies to the 
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protein under conditions suitable for complex formation. The amount of 
standard complex formation may be quantified by various methods, but 
preferably by photometric means. Quantities of protein expressed in control 
and disease, samples e.g. from biopsied tissues are compared with the 
standard values. Deviation between standard and subject values 
establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides specific for 
the DG153 proteins and homologous proteins may be used for diagnostic 
purposes. The polynucleotides, which may be used, include oligonucleotide 
sequences, antisense RNA and DNA molecules, and PNAs. The 
polynucleotides may be used to detect and quantitate gene expression in 
biopsied tissues in which gene expression may be correlated with disease. 
The diagnostic assay may be used to distinguish between absence, 
presence, and excess gene expression, and to monitor regulation of protein 
levels during therapeutic intervention. 

In one aspect, hybridization with probes which are capable of detecting 
polynucleotide sequences, including genomic sequences, encoding the 
proteins of the invention and homologous proteins or closely related 
molecules, may be used to identify nucleic acid sequences which encode 
the respective protein. The hybridization probes of the subject invention 
may be DNA or RNA and are preferably derived from the nucleotide 
sequence of the polynucleotide encoding the proteins of the invention or 
from a genomic sequence including promoter, enhancer elements, and 
• iritrons of the naturally occurring gene. Hybridization probes may be labeled 
by a variety of reporter groups, for example, radionuclides such as ^^P or 
^^S or enzymatic labels, such as alkaline phosphatase coupled to the probe 
via avidln/biotin coupling systems, and the like. 

Polynucleotide sequences specific for DG153 proteins and homologous 
nucleic acids may be used for the diagnosis of conditions or diseases. 



- 30- 

which are associated with the expression of the proteins. Examples of such 
diseases include the pancreatic diseases (e.g. diabetes), obesity, metabolic 
syndrome, and/or others. Polynucleotide sequences specific for the DG1 53 
proteins and homologous proteins may also be used to monitor the 
progress of patients receiving treatment for pancreatic diseases (e.g. 
diabetes), obesity, and/or metabolic syndrome. The polynucleotide 
sequences may be used qualitative or quantitative assays, e.g. in Southern 
or Northern analysis, dot blot or other membrane-based technologies; in 
PGR technologies; or in dip stick, pin, ELISA or chip assays utilizing fluids 
or tissues from patient biopsies to detect altered gene expression. 

In a particular aspect, the DQ153 nucleotide sequences may be useful In 
assays that detect activation or induction of various metabolic diseases or 
dysfunctions. The nucleotide sequences may be labeled by standard 
methods, and added to a fluid or tissue sample from a patient under 
conditions suitable for the formation of hybridization complexes. After a 
suitable incubation period, the sample is washed and the signal is 
quantitated and compared with a standard value. The presence of altered 
levels of nucleotide sequences encoding the proteins of the invention and 
homologous proteins in the sample indicates the presence of the 
associated disease. Such assays may also be used to evaluate the efficacy 
of a particular therapeutic treatment regimen in animal studies, in clinical 
trials or in monitoring the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disease associated with 
expression of the DG153 proteins and homologous proteins, a normal or 
standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either 
animal or human, with a sequence or a fragment thereof, which is specific 
for the nucleic acids encoding the proteins of the invention and 
homologous nucleic acids, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the 
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values obtained from normal subjects with those from an experiment where 
a Icnown amount of a substantially purified polynucleotide is used. 
Standard values obtained from normal samples may be compared with 
values obtained from samples from patients who are symptomatic for 
disease. Deviation between standard and subject values is used to 
establish the presence of disease. Once disease is established and a 
treatment protocol is initiated, hybridization assays may be repeated on a 
regular basis to evaluate whether the level of expression in the patient 
begins to approximate that, which is observed in the normal patient. The 
results obtained from successive assays may be used to show the efficacy 
of treatment over a period ranging from several days to months. 

With respect to pancreatic diseases (e.g. diabetes), obesity, and/or 
metabolic syndrome, the presence of an unusual amount of transcript in 
biopsied tissue from an individual may indicate a predisposition for the 
development of the disease or may provide a means for detecting the 
disease prior to the appearance of actual clinical symptoms. A more 
definitive diagnosis of this type may allow health professionals to employ 
preventative measures or aggressive treatment earlier thereby preventing 
the development or further progression of the metabolic diseases and 
disorders. 

Additional diagnostic uses for oligonucleotides designed from the 
sequences encoding the proteins of the invention and homologous proteins 
may involve the use of PGR. Such oligomers may be chemically 
synthesized, generated enzymatically or produced from a recombinant 
source. Oligomers will preferably consist of two nucleotide sequences, one 
with sense orientation (5prime.fwdarw.3prime) and another with antisense 
(Sprime.rarw.Sprime), employed under optimized conditions for 
identification of a specific gene or condition. The same two oligomers, 
nested sets of oligomers or even a degenerate pool of oligomers may be 
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empioyed under less stringent conditions for detection and/or quantification 
of closely related DNA or RNA sequences. 

In another embodiment of the invention, the nucleic acid sequences may 
also be used to generate hybridization probes, which are useful for 
mapping the naturally occurring genomic sequence. The sequences may be 
mapped to a particular chromosome or to a specific region of the 
chromosome using well known techniques. Such techniques include FISH, 
FACS or artificial chromosome constructions, such as yeast artificial 
chromosomes, bacterial artificial chromosomes, bacterial PI constructions 
or single chromosome cDNA libraries as reviewed in Price, C. M, (1993) 
Blood Rev. 7:127-134, and Trask, B. J. (1991) Trends Genet. 7:149-154. 
FISH (as described in Verma et al. (1988) Human Chromosomes: A Manual 
of Basic Techniques, Pergamon Press, New York, N-Y.). The results may 
be correlated with other physical chromosome mapping techniques and 
genetic map data. Examples of genetic map data can be found in the 1 994 
Genome Issue of Science (265:1 981f). Correlation between the location of 
the gene encoding the proteins of the Invention on a physical chromosomal 
map and a specific disease or predisposition to a specific disease, may help 
to delimit the region of DNA associated with that genetic disease. 

The nucleotide sequences of the subject invention may be used to detect 
differences in gene sequences between normal, carrier or affected 
individuals. An analysis of polymorphisms, e.g. single nucleotide 
polymorphisms may be carried out. Further, in situ hybridization of 
chromosomal preparations and physical mapping techniques such as 
linkage analysis using established chromosomal markers may be used for 
extending genetic maps. Often the placement of a gene on the 
chromosome of another mammalian species, such as mouse, may reveal 
associated markers even if the number or arm of a particular human 
chromosome is not known. New sequences can be assigned to 
chromosomal arms or parts thereof, by physical mapping. This provides 
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valuable information to investigators searching for disease genes using 
positional cloning or other gene discovery techniques. Once the disease or 
syndrome has been crudely localized by genetic linkage to a particular 
genomic region, for example, AT to 11q22-23 (Gatti, R. A. et al. (1988) 
Nature 336:577*580), any sequences mapping to that area may represent 
associated or regulatory genes for further investigation. The nucleotide 
sequences of the subject invention may also be used to detect differences 
in the chromosomal location due to translocation, inversion, etc. among 
normal, carrier or affected individuals. 

In another embodiment of the invention, the proteins of the invention, their 
catalytic or immunogenic fragments or oligopeptides thereof, an in vitro 
model, a genetically altered cell or animal, can be used for screening 
libraries of compounds in any of a variety of drug screening techniques. 
One can identify effectors, e.g. receptors, enzymes, proteins, ligands, or 
substrates that bind to, modulate or mimic the action of one or more of the 
DG1 53 proteins of the invention. The protein or fragment thereof employed 
in such screening may be free in solution, affixed to a solid support, borne 
on a cell surface, or located intracellulary. The formation of binding 
complexes, between the DG153proteins of the invention and the agent 
tested, may be measured. Agents could also, either directly or indirectly, 
influence the activity of the proteins of the invention. 

In addition activity of the proteins of the invention against their 
physiological substrate(s) or derivatives thereof could be measured in 
cell-based or cell-free assays. Agents may also interfere with 
posttranslational modifications of the protein, such as phosphorylation and 
dephosphorylation, farnesylation, palmltoylation, acetylation, alkylation, 
ubiquitination, proteolytic processing, subcellular localization and 
degradation. Moreover, agents could influence the dimerization or 
oligomerization of the proteins of the invention or, in a heterologous 
manner, of ths proteins of the inventionr with other proteins, for example. 
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but not exclusively, docking proteins, enzymes, receptors, or translation 
factors. Agents could also act on the physical interaction of the proteins of 
this Invention with other proteins, which are required for protein function, 
for example, but not exclusively, their downstream signaling. 

Methods for determining protein-protein interaction are well known in the 
art. For example binding of a fluorescently labeled peptide derived from the 
interacting protein to the DG153 protein of the invention, or vice versa, 
could be detected by a change in polarisation. In case that both binding 
partners, which can be either the full length proteins as well as one binding 
partner as the full length protein and the other just represented as a 
peptide are fluorescently labeled, binding could be detected by 
fluorescence energy transfer (FRET) from one fluorophore to the other. In 
addition, a variety of commercially available assay principles suitable for 
detection of protein-protein Interaction are well known In the art, for 
example but not exclusively AlphaScreen (PerkinElmer) or Scintillation 
Proximity Assays (SPA) by Amersham. Alternatively, the interaction of the 
DG153 proteins of the invention with cellular proteins could be the basis 
for a cell-based screening assay, in which both proteins are fluorescently 
labeled and interaction of both proteins is detected by analysing 
cotranslocation of both proteins with a cellular imaging reader, as has been 
developed for example, but not exclusively, by Cellomics or EvotecOAI. In 
all cases the two or more binding partners can be different proteins with 
one being the protein of the invention, or in case of dimerization and/or 
ollgomerization the protein of the invention itself. 

Of particular interest are screening assays for agents that have a low 
toxicity for mammalian cells. The term "agent" as used herein describes 
any molecule, e.g. protein or pharmaceutical, with the capability of altering 
or mimicking the physiological function of one or more of the proteins of 
the invention. Candidate agents encompass numerous chemical classes, 
though typically they are organic molecules, preferably small organic 
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compounds having a molecular weight of more than 50 and less than 
about 2,500 Daltons. Candidate agents comprise functional groups 
necessary for structural interaction with proteins, particularly hydrogen 
bonding, and typically include at least an amine, carbonyl, hydroxyl or 
carboxyl group, preferably at least two of the functional chemical groups. 
The candidate agents often comprise carbocyclic or heterocyclic structures 
and/or aromatic or polyaromatic structures substituted with one or more of 
the above functional groups. 

Candidate agents are also found among biomolecules including peptides, 
saccharides, fatty acids, steroids, purines, pyrimidines, nucleic acids and 
derivatives, structural analogs or combinations thereof. Candidate agents 
are obtained from a wide variety of sources including libraries of synthetic 
or natural compounds. For example, numerous means are available for 
random and directed synthesis of a wide variety of organic compounds and 
biomolecules, including expression of randomized oligonucleotides and 
oligopeptides. Alternatively, libraries of natural compounds in the form of 
bacterial, fungal, plant and animal extracts are available or readily 
produced. Additionally, natural or synthetically produced libraries and 
compounds are readily modified through conventional chemical, physical 
and biochemical means, and may be used to produce combinatorial 
libraries. Known pharmacological agents may be subjected to directed or 
random chemical modifications, such as acylation, alkylation, esterification, 
amidification, etc. to produce structural analogs. Where the screening 
assay is a binding assay, one or more of the molecules may be joined to a 
label, where the label can directly or indirectly provide a detectable signal. 

Another technique for drug screening, which may be used, provides for 
high throughput screening of compounds having suitable binding affinity to 
the protein of interest as described in published PCT application 
WO84/03564. In this method, as applied to the proteins of the invention 
large numbers of different small test compounds, e-g. aptamers, peptides.. 



- 36 - 

low-molecuiar weight compounds etc., are provided or synthesized on a 
solid substrate, such as plastic pins or some other surface. The test 
compounds are reacted with the proteins or fragments thereof, and 
washed. Bound proteins are then detected by methods well known in the 
art. Purified proteins can also be coated directly onto plates for use in the 
aforementioned drug screening techniques. Alternatively, non-neutralizing 
antibodies can be used to capture the peptide and immobilize It on a solid 
support. In another embodiment, one may use competitive drug screening 
assays in which neutralizing antibodies capable of binding the protein 
specifically compete with a test compound for binding the protein. In this 
manner, the antibodies can be used to detect the presence of any peptide, 
which shares one or more antigenic determinants with the protein. 

Compounds that bind DG153 proteins, e.g. antibodies, are useful for the 
identification or enrichment of cells, which are positive for the expression 
of the proteins of the invention, from complex cell mixtures. Such cell 
populations are useful in transplantation, for experimental evaluation, and 
as source of lineage and cell specific products, including mRNA species 
useful in identifying genes specifically expressed in these cells, and as 
target for the identification of factors of molecules that can affect them. 
Cells expressing the protein of the invention or which have been treated 
with the protein of the invention are useful in transplantation to provide a 
recipient with pancreatic islet cells, including insulin producing beta cells; 
for drug screening; experimental models of islet differentiation and 
interaction with other cell types; in vitro screening assays to define growth 
and differentiation factors, and to additionally characterize genes involved 
In islet development and regulation; and the like. The native cells may be 
used for these purposes, or they may be genetically modified to provide 
altered capabilities. Cells from a regenerating pancreas, from embryonic 
foregut, stomach and duodenum, or other sources of pancreatic progenitor 
cells may be used as a starting population. The progenitor cells may be 
obtained from any mammalian species, e.g. equine, bovine, porcine. 
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canine, feline, rodent, e.g. mice, rats, hamster, primate, etc. particularly 
human. 

In another embodiment, in a high-throughput screening method, the cells 
are transfected with a DNA construct, e.g. a viral or non-viral vector 
containing a reporter gene, e.g. the lacZ gene or the GFP gene, under 
regulatory control of a promoter of a gene involved in for example beta-cell 
differentiation, e.g. a promoter of a gene stimulation beta-cell 
differentiation, preferably a Pax4 promoter. The transfected cells are 
divided into aliquots and each aliquot is contacted with a test substance, 
e.g., candidate 1, candidate 2 and candidate 3. The activity of the 
reporter gene corresponds to the capability of the test compound to induce 
beta-cell differentiation. 

In a further embodiment, which may be combined with the high-throughput 
screening as described above, a medium throughput validation is carried 
out. Therein, the test compound is added to stem cells being cultivated and 
the insulin production is determined. Following an initial high throughput 
assay, such as the cell based assay outlined above where for example a 
Pax4 promoter is used as marker for beta-cell regeneration, the activity of 
candidate molecules to induce beta-cell differentiation is tested in a 
validation assay comprising adding said compounds to the culture media of 
the embryoid bodies. Differentiation into insulin-producing cells is then 
evaluated, e.g. by comparison to wild type and/or Pax4 expressing ES cells 
to assess the effectiveness of a compound. 

The nucleic acids encoding the DG153 proteins of the invention can be 
used to generate transgenic cell lines and animals. These transgenic 
non-human animals are useful in the study of the function and regulation of 
the proteins of the invention in vivo. Transgenic animals, particularly 

mammalian transgenic animals, can serve as a model system for the 
investigation of many dei'elopmantal and cellular processas common to 
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humans. A variety of non-human models of metabolic disorders can be 
used to test modulators of th& protein of the invention. Misexpression (for 
example, overexpression or lack of expression) of the protein of the 
invention, particular feeding conditions, and/or administration of 
biologically active compounts can create models of metablic disorders. 

In one embodiment of the invention, such assays use mouse models of 
insulin resistance and/or diabetes, such as mice carrying gene knockouts in 
the leptin pathway (for example, ob (leptin) or db (leptin receptor) mice), as 
described above. In addition to testing the expression of the proteins of the 
invention in such mouse strains (see EXAMPLES), these mice could be 
used to test whether administration of a candidate modulator alters for 
example lipid accumulation in the liver, in plasma, or adipose tissues using 
standard assays well known in the art, such as FPLC, colorimetric assays, 
blood glucose level tests, insulin tolerance tests and others. 

Transgenic animals may be made through homologous recombination in 
embryonic stem cells, where the normal locus of the gene encoding the 
protein of the invention is mutated. Alternatively, a nucleic acid construct 
encoding the protein is injected into oocytes and is randomly integrated 
into the .genome. One may also express the genes of the invention or 
variants thereof in tissues where they are not normally expressed or at 
abnormal times of development. Furthermore, variants of the genes of the 
invention like specific constructs expressing anti-sense molecules or 
expression of dominant negative mutations, which will block or alter the 
expression of the proteins of the invention may be randomly integrated into 
the genome. A detectable marker, such as lac Z or lucif erase may be 
introduced into the locus of the genes of the invention, where upregulation 
of expression of the genes of the invention will result in an easily 
detectable change in phenotype. Vectors for stable integration include 
plasmids, retroviruses and other animal viruses, yeast artificial 
chromosomes (YACs), and the like. DNA constructs for homologous 
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recombination will contain at least portions of the genes of the invention 
with the desired genetic modification, and will include regions of homology 
to the target locus. Conveniently, markers for positive and negative 
selection are included. DNA constructs for random integration do not need 
to contain regions of homology to mediate recombination. DNA constructs 
for random integration will consist of the nucleic acids encoding the 
proteins of the invention, a regulatory element (promoter), an intron and a 
poly-adenylation signal. Methods for generating cells having targeted gene 
modifications through homologous recombination are known in the field. 
For embryonic stem (ES) cells, an ES cell line may be employed, or 
embryonic cells may be obtained freshly from a host, e.g. mouse, rat, 
guinea pig, etc. Such cells are grown on an appropriate fibroblast-feeder 
layer and are grown in the presence of leukemia inhibiting factor (LIF). ES 
or embryonic cells may be transfected and can then be used to produce 
transgenic animals. After transf action, the ES cells are plated onto a feeder 
layer in an appropriate medium. Cells containing the construct may be 
selected by employing a selection medium. After sufficient time for 
colonies to grow, they are picked and analyzed for the occurrence of 
homologous recombination. Colonies that are positive may then be used for 
embryo manipulation and morula aggregation. Briefly, morulae are obtained 
from 4 to 6 week old superovulated females, the Zona Pellucid a is removed 
and the morulae are put into small depressions of a tissue culture dish. The 
ES cells are trypsinized, and the modified cells are placed into the 
depression closely to the morulae. On the following day the aggregates are 
transfered into the uterine horns of pseudopregnant females. Females are 
then allowed to go to term. Chimeric offsprings can be readily detected by 
a change in coat color and are subsequently screened for the transmission 
of the mutation into the next generation (F1 -generation). Offspring of the 
F1 -generation are screened for the presence of the modified gene and 
males and females having the modification are mated to produce 
homozygous progeny. If the gene alterations cause lethality at some point 
in development, tissues or organs can be maintained as allogenic or 
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congenic grafts or transplants, or in vitro culture. The transgenic animals 
may be any non-human mammal, such as laboratory animal, domestic 
animals, etc., for example, mouse, rat, guinea pig, sheep, cow, pig, and 
others. The transgenic animals may be used in functional studies, drug 
screening, and other applications and are useful in the study of the 
function and regulation of the proteins of the invention in vivo. 

Finally, the invention also relates to a kit comprising at least one of 

(a) a nucleic acid molecule coding for a protein of the invention or a 
functional fragment thereof; 

(b) a protein of the invention or a fragment or an isoform thereof; 

(c) a vector comprising the nucleic acid of <a); 

(d) a host cell comprising the nucleic acid of (a) or the vector of (b); 

(e) a polypeptide encoded by the nucleic acid of (a); 

(f) a fusion polypeptide encoded by the nucleic acid of (a); 

(g) an antibody, an aptamer or another effector / modulator against the 
nucleic acid of (a) or the polypeptide of (b), (e) or (f) and 

(h) an anti-sense oligonucleotide of the nucleic acid of (a). 

The kit may be used for diagnostic or therapeutic purposes or for screening 
applications as described above. The kit may further contain user 
instructions. 



The Figures show: 

Figure 1 shows human DG153 nucleic acid and proteins 

Figure 1 A shows the nucleic acid sequence of human DG153 protein (SEQ 

ID NO: 1). 

Figure IB shows the amino acid sequence (one-letter code) of human 
DG153 protein, longer variant (SEQ ID NO: 2). 

Figure 1C shows the amino acid sequence (one-letter code) of human 
DG153 protein, shorter variant (SEQ ID NO: 3). 
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Figure 2 shows the analysis of DG153 protein expression in mammalian 
tissues. The relative RNA-expression is shown on the Y-axis, in Figure 2A 
to 2C the tissues tested are given on the X-axis. WAT refers to white 
adipose tissue, BAT refers to brown adipose tissue. In Figure 2D, 2E, and 
2F, the X-axis represents the time axis. 'dO' refers to day O (start of the 
experiment), "dIO" refers to day 10 of adipocyte differentiation, "d12" 
refers to day 12 of adipocyte differentiation). 

Figure 2A shows the quantitative analysis of DG153 expression in mouse 
wild-type tissues. 

Figure 2B shows the quantitative analysis of DG153 expression in 
wild-type mice (wt-mice), compared to genetically obese mice (ob/ob-mice) 
and to fasted mice (fasted-mice). 

Figure 2C shows the quantitative analysis of DGl 53 expression in mice fed 
with a control diet compared to mice fed with a high fat diet. 
Figure 2D shows the quantitative analysis (taqman) of DGl 53 expression 
in mammalian fibroblast (3T3-L1) cells, during the differentiation from 
preadipocytes to mature adipocytes. 

Figure 2E shows the quantitative analysis (microarray) of DGl 53 
expression in human abdominal adipocyte cells, during the differentiation 
from preadipocytes to mature adipocytes. 

Figure 2F shows the quantitative analysis of DGl 53 expression in human 
abdominal adipocyte cells, during the differentiation from preadipocytes to 
mature adipocytes. 

The examples illustrate the invention: 

Example 1 : Identification of secreted factors expressed in pancreas 

A screen for secreted factors expressed In developing mouse pancreas was 
carried oul^according to methods known by those skilled in the art (see, for 
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example Pera E.M. and De Robertis E.M.. (2000) Mech Dev 96(2): 
183-195) with several modifications. 

Expression cDNA library: 

A mouse embryonic stage 9.5-15 pancreatic bud library was prepared in 
pCMVSPORT-6 vector using SUPERSCRIPT Plasmid System from 
Invitrogen according to the manufacturer's instructions. The non-amplified 
library was electroporated into MaxEff DH10B cells (Invitrogen). 

Secretion cloning 

Bacterial clones were picked with sterile toothpicks from agar plates and 
cultured in 96-deep-well microtiter plates in LB-ampicillin (see Sambrooket 
al., supra). Aliquots of 8 cultures were pooled, and plasmid DNAwas 
isolated using the BioRobot_9600 apparatus according to the 
manufacturer's instructions (Qiagen; QIAprep(r) Turbo BioRobot Kit. 
Human 293 cell culture cells were cultured in 75 ml tissue culture flasks in 
DMEM and 10% fetal calf serum. At 90-99% confluence, the cells were 
splitted at 1 :3 ratio and plated onto poly-D-lysine (Sigma) coated 96-well 
plates. Cells were transfected with 100-500 ng plasmid using 
lipofectamine 2000 (Invitrogen). After 6 hours, the medium was 
exchanged for fresh complete growth medium. 24 hours after transfection, 
the cells were washed twice with DMEM without cysteine and methionine 
(Invitrogen), supplemented with 1 % dialysed Bovine serum (Sigma) with 
50 microgram per ml Heparin (Sigma) and glutamine. The cells were 
labeled radioactively ('S35 Met-label', from Hartmann Analytic GmbH). 
After 1 2 hours, aliquots of the supernatants were harvested in 96-well PCR 
plates and subjected to SDS gel electrophoresis in precast 4 ±20% 
gradient polyacrylamide Criterion gels (Biorad) under reducing conditions, 
using Criterion Dodeca Cell gel running chamber (Biorad). The gels were 
fixed in 10% acetic acid, 25% isopropanol for 30 min, soaked 15-30 min 
in AMPLIFY reagent (Amersham), dried and exposed to X-OMAT (AR) film 
(Kodak). Positive clones were identified and regrown in 96-well-plates. 
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DNA of individual clones was prepared and used for transfection as 
described above. If one of the clones yielded proteins of the same size as 
that of the original pool, a positive clone was identified. Positive clones 
were partially sequenced from the 5' end (SEQLAB, Goettingen). 

Example 2: Identification of the human DG153 homologous nucleic acid 
and proteins (FIGURE 1 ) 

The term "polynucleotide comprising the nucleotide sequence as shown in 
GenBank Accession number" relates to the expressible gene of the 
nucleotide sequences deposited under the corresponding GenBank 
Accession number. The term "GenBank Accession number" relates to NCBI 
GenBank database entries (Ref.: Benson et al.. Nucleic Acids Res. 28 
(2000) 15-18). 

DG153 homologous proteins and nucleic acid molecules coding therefore 
are obtainable from insect or vertebrate species, e.g. mammals or birds. 
Particularly preferred are nucleic acids comprising human DG153 
homologs. The following mouse sequence was identfied in the secreted 
factor screen': Mus musculus arginine-rich, mutated in early stage tumors 
(Armet), GenBank Accession Number NM_029103 (838 base pairs mRNA) 
and GenBank Accession Number NP_083379" (179 amino acid protein)). 
Sequences homologous to mouse DG1 53 were identified using the publicly 
available program BLASTP 2.2.3 of the non-redundant protein data base of 
the National Center for Biotechnology Information (NCBI) (see, Altschul et 
al., 1997, Nucleic Acids Res. 25:3389-3402). The best human homolog of 
mouse DG1 53 is Homo sapiens arginine-rich protein (ARP) gene GenBank 
Accession Number M83751 (1103 base pairs mRNA; SEQ ID NO: 1; see 
FIGURE la) and GenBank Accession Number AAB08753.1 (234 amino acid 
protein; SEQ ID NO: 2; see FIGURE lb) and Swisspro Acession Number 
P55145 (179 amino acid protein; SEQ ID IMO: 3; see FIGURE 1c). The 234 
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amino acid protein is a longer sequence with arginine-rich regions thought 
to be the target of cancer-causing mutation. 

Example 3: Expression of the polypeptides in mammalian tissues (FIGURE 
2) 

To analyse the expression of the polypeptides disclosed in this invention in 
mammalian tissues, several mouse strains (preferably mice strains 
C57BI/6J, C57BI/6 ob/ob and. C57BI/KS db/db which are standard model 
systems in obesity and diabetes research) were purchased from Harlan 
Winkelmann (33178 Borchen, Germany) and maintained under constant 
temperature (preferably 22° C), 40 per cent humidity and a light / dark 
cycle of preferably 14/10 hours. The mice were fed a standard chow (for 
example, from ssniff SpezialitSten GmbH, order number ssniff M-Z 
V1 126-000). For the fasting experiment ("fasted wild-type mice"), 
wild-type mice were starved for 48 h without food, but only water supplied 
ad libitum (see, for example, Schnetzler et al. J Clin Invest 1993 
Jul;92(1):272-80, Mizuno et al. Proc Natl Acad Sci U S A 1996 Apr 
16;93(8):3434-8). Animals were sacrificed at an age of 6 to 8 weeks. The 
animal tissues were isolated according to standard procedures known to 
those skilled in the art, snap frozen in liquid nitrogen and stored at -80° C 
until needed. 

For analyzing the role of the proteins disclosed in this invention in the in 
vitro differentiation of different mammalian cell culture cells for the 
conversion of pre-adipocytes to adipocytes, mammalian fibroblast (3T3-L1 ) 
cells (e.g.. Green & Kehinde, Cell 1: 113-116, 1974) were obtained from 
the American Tissue Culture Collection (ATCC, Hanassas, VA. USA; 
ATCC- CL 173). 3T3-L1 cells were maintained as fibroblasts and 
differentiated into adipocytes as described in the prior art (e.g., Qiu. et al., 
J. Biol. Chem. 276:11988-95, 2001; Slieker et al., BBRC 251: 225-9, 
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1998). In brief, cells were plated in DMEM/10% FCS (Invitrogen, 
Karlsruhe, Germany) at 50,000 cells/well In duplicates in 6-well plastic 
dishes and cultured in a humidified atmosphere of 5% C02 at 37° C. At 
confluence {defined as day O: dO) cells were transferred to serum-free (SF) 
medium, containing DMEM/HamF12 (3:1; Invitrogen), fetuin (300 //g/ml; 
Sigma, Munich, Germany), transferrin (2>i/g/ml; Sigma), pantothenate (17 
^M; Sigma), biotin (1 //M; Sigma), and EGF (O.SnM; Hoffmann-La Roche, 
Basel, Switzerland). Differentiation was induced by adding dexamethasone 
(DEX; 1 /M; Sigma), 3-methyl-isobutyl-1-methylxanthine {MIX; O.SmM; 
Sigma), and bovine insulin {5iwg/ml; Invitrogen). Four days after confluence 
{d4), cells were kept in SF medium, containing bovine insulin (5 pg/ml) until 
differentiation was completed. At various time points of the differentiation 
procedure, beginning with day 0 {day of confluence) and day 2 (hormone 
addition; for example, dexamethasone and 3-isobutyl-1-methylxanthine), up 
to 10 days of differentiation, suitable aliquots of cells were taken every 
two days. 

RNA was isolated from mouse tissues or cell culture cells using Trizol 
Reagent (for example, from Invitrogen, Karlsruhe, Germany) and further 
purified with the RNeasy Kit (for example, from Qiagen, Germany) in 
combination with an DNase-treatment according to the instructions of the 
manufacturers and as known to those skilled in the art. Total RNA was 
reverse transcribed (preferably using Superscript II RNaseH- Reverse 
Transcriptase, from Invitrogen. Karlsruhe, Germany) and subjected to 
Taqman analysis preferably using the Taqman 2xPCR Master Mix (from 
Applied Biosystems, Weiterstadt, Germany; the Mix contains according to 
the Manufacturer for example AmpliTaq Gold DNA Polymerase, AmpErase 
UNG, dNTPs with dUTP, passive reference Rox and optimized buffer 
components) on a GeneAmp 5700 Sequence Detection System (from 
Applied Biosystems, Weiterstadt, Germany). 
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The following prime/probe pairs were used for the TaqK/lan analysis 
(GenBank Accession Number NM_029103 for the mouse DG153 
sequence): Mouse DG315 forward primer {Seq ID N0:4): 5'- AGA GAA 
TCG GTT GTQ CTA CTA CATTG -3'; mouse DG315 reverse primer (Seq 
ID NO:5): 5'- GGC TTC GAC ACC TCA TTG ATG -3'; mouse DG315 
Taqman probe (Seq ID NO:6): (5/6-FAM) AGO CAC AGA TGA TGC TGC 
CAC CAA (5/6-TAMRA). 

The function of the mammalian DQ1 53 in metabolism was further validated 
by analyzing the expression of the transcripts in different tissues and by 
analyzing the role in adipocyte differentiation. 

Expression profiling studies confirm the particular relevance of DG1 53 as 
regulator of energy metabolism in mammals. Taqman analysis revealed that 
DG1 53 is expressed in several mammalian tissues, with highest expression 
levels in testis in wild type mice. In addition, DG1 53 is highly expressed in 
metabolic active tissue such as white adipose tissue (WAT) and at lower 
levels in brown adipose tissue (BAT) compared to other tissue types in wild 
type mouse as depicted in Figure 2A. 

We used mouse models of insulin resistance and/or diabetes, such as mice 
carrying gene knockouts in the leptin pathway (for example, ob/ob (leptin) 
or db (leptin receptor/ligand) mice) to study the expression of DG1 53. Such 
mice develop typical symptoms of diabetes, show hepatic lipid 
accumulation and frequently have increased plasma lipid levels (see 
Bruning et al, 1998, Mol. Cell. 2:449-569). We found, for example, that 
the expression of DG153 is downregulated in metabolic active tissue 
(WAT) in genetically induced obese mice (ob/ob) compared to wild type 
mice (see Figure 2B). 



Expression of DG153 mRNA was also examined in susceptible wild type 
mice (for example, C57BI/6) that show symptoms of diabetes, lipid 
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accumulation, and high plasma lipid levels, if fed a high fat diet. In those 
mice, the expression of DQ153 is significantly downregulated in WAT and 
upregulated in BAT and muscle supporting that DQ153 is involved in the 
regulation of rnammalian metabolism (see Figure 2C). 

We show in this invention (see Examples and Figure 2D) that the DG153 
protein is expressed and decreased during the differentiation into mature 
adipocyctes. Therefore, the DG153 protein might play an essential role in 
adipogenesis. 

Example 4, Analysis of the differential expression of transcripts of the 
proteins of the invention in human tissues 

RNA preparation from human primary adipose tissues was done as 
described in EXAMPLE 2. The hybridization and scanning was performed 
as described in the manufactures manual (see Affymetrix Technical 
Manual, 2002, obtained from Affmetrix, Santa Clara, USA). 

The expression analysis (using Affymetrix GeneChips) of the DG153 gene 
using primary human abdominal adipocycte differentiation clearly shows 
. differential expression of human DG153 in adipocytes. Two independent 
experiments were done. Both experiments show that the DG153 
transcripts are the most abundance at day O compared to day 1 2 during 
differentiation (fig. 3e). These data further confirm the mouse 3T3L-1 
differentiation data. 

Thus, the DG153 protein has to be slightly decreased in order for the 
preadipocyctes to differentiate into mature adipocycte. The DG1 53 protein 
In preadipocyctes has the potential to inhibit adipose differentiation. 
Therefore, the-D.G153 protein might play an essential role in the regulation 
of human mstebolism, in particular in the regulation of adipogenesis and 
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thus It might be an essential role in pancreatic diseases (e.g. diabetes), 
obesity, and/or metabolic syndrome. 

For the purpose of the present invention, it will understood by the person 
having average skill in the art that any combination of any feature 
mentioned throughout the specification is explicitly disclosed herewith. 
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Claims 

A pharmaceutical composition comprising a nucleic acid molecule 
encoding DG1 53 protein and/or a functional fragment thereof and/or 
an effector/modulator of said nucleic acid molecule and/or said 
protein or protein fragment. 

The composition of claim 1, wherein the compositon contains 
pharmaceutically acceptable carriers, diluents, and / or additives. 

The composition of claim 1 or 2, wherein the nucleic acid molecule 
is a mammalian DGl 53 nucleic acid, particulary encoding the human 
DG153 polypeptide and/or a nucleic molecule which is 
complementary thereto or a fragment thereof or a variant thereof. 

The composition of any one of claims 1 to 3 , wherein said nucleic 
acid molecule is selected from the group consisting of 

(a) a nucleic acid molecule encoding a polypeptide as shown in 
SEQ ID NO: 2 or 3, or an isoform, fragment or variant of the 
polypeptide as shown in SEQ ID NO: 2 or 3; 

(b) a nucleic acid molecule which comprises or is the nucleic acid 
molecule as shown in SEQ ID NO: 1 ,* 

(c) a nucleic acid morecule being degenerate with as a result of 

I 

the genetic code to the nucleic acid sequences as defined in 
(a) or (b), 

(d) a nucleic acid molecule that hybridizes at 50<»C in a solution 
containing 1 x SSC and 0.1 % SDS to a nucleic acid molecule 
as defined in claim 2 or as defined in (a) to (c) and/or a 
nucleic acid molecule which is complementary thereto; 
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(e) a nucleic acid molecule that encodes a polypeptide which is 
at least 85%, preferably at least 90%, more preferably at 
least 95%, more preferably at least 98% and up to 99,6% 
identical to the human DG153, as defined in claim 2 or to a 
polypeptide as defined in (a); 

(f) a nucleic acid molecule that differs from the nucleic acid 
molecule of (a) to (e) by mutation and wherein said mutation 
causes an alteration, deletion, duplication or premature stop 
in the encoded polypeptide 

The composition of any one of claims 1-4, wherein the nucleic acid 
molecule is a DNA molecule, particularly a cDNA or a genomic DNA. 

The composition of any one of claims 1-5, wherein said nucleic acid 
encodes a polypeptide contributing to regulating the metabolism, in 
particular human metabolism. 

The composition of any one of claims 1-6, wherein said nucleic acid 
molecule is a recombinant nucleic acid molecule. 

The composition of any one of claims 1-7, wherein the nucleic acid 
molecule is a vector, particularly an expression vector. 

The composition of any one of claims 1-8, wherein the polypeptide 
is a recombinant polypeptide. 

The composition of claim 9, wherein said recombinant polypeptide is 
a fusion polypeptide. 

The composition of any one of claims 1-10, wherein said nucleic 
acid molecule is selected from hybridization probes, primers and 
anti-sensa oligonucleotides. 
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12. The composition of any one of claims 1-11 which is a diagnostic 
composition. 

13. The composition of any one of claims 1-11 which is a therapeutic 
5 composition. 

14. The composition of any one of claims 1-13 for the manufacture of 
an agent for detecting and/or verifying, for the treatment, alleviation 
and/or prevention pancreatic diseases (e.g. diabetes), obesity, 

,0 metabolic syndrome and/or other metabolic diseases- or 

dysfunctions. 

15. The composition of any one of claims 1-14 for the manufacture of 
an agent for the modulation of pancreatic development. 

15 ' • 

16. The composition of any one of claims 1-15 for the manufacture of 
an agent for the regeneration of pancreatic tissues or cells, 
particularly pancreatic beta cells. 

20 1 7. The composition of any one of claims 1-1 6 for application in vivo. 

1 8. The composition of any one of claims 1-16 for application in vitro. 

19. Use of a DG153 nucleic acid molecule or a polypeptide encoded 
28 thereby or a fragment or a variant of said nucleic acid molecule or 

said polypeptide and/or an effector/modulator of said nucleic or 
polypeptide for the manufacture of a medicament for the treatment 
pancreatic diseases (e.g. diabetes), obesity, metabolic syndrome and 
and/or other metabolic diseases or dysfunction for controlling the 
30 function of a gene and/or a gene product which is influenced and/or 

modified by a DG153 polypeptide. 
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20. Use of a DG153 nucleic acid molecule or use of a polypeptide 
encoded thereby, or use of a fragment or a variant of said nucleic 
acid molecule or said polypeptide, or use of an effector/modulator of 
said nucleic acid molecule or said polypeptide for identifying 
substances capable of interacting with a DG153 polypeptide in vitro 
and/or in vivo. 

21 . A non-human transgenic animal exhibiting a modified expression of 
a DG153 polypeptide. 

22. The animal of claim 21, wherein the expression of the DG153 
polypeptide is increased and/or reduced. 

23. A recombinant host cell exhibiting a modified expression of a DG1 53 
polypeptide, or a recombinant host cell which comprises a nucleic 
acid molecule as defined in any one of claims 1 to 7. 

24. The cell of claim 23 which is a human cell. 

25. A method of identifying a (poly)peptide involved In the regulation of 
energy homeostasis and/or metabolism in a mammal comprising the 
steps of 

(a) contacting a collection of (poly)peptides with a DG153 
homologous polypeptide or a fragment thereof under 
conditions that allow binding of said (poly)peptides; 

(b) removing (poly) peptides which do not bind and 

(c) identifying (poly)peptides that . bind to said DG153 
homologous polypeptide. 



26. A method of screening for an agent which effects/modulates the 
interaction of a DG1 53 polypeptide with a binding targetcomprising 
the ctaps. of 
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(a) incubating a mixture comprising 

(aa) a DG153 polypeptide or a fragment thereof; 

(ab) a binding target/agent of said DG153 polypeptide or 
fragment thereof; and 

(ac) a candidate agent under conditions whereby said 
polypeptide or fragment thereof specifically binds to 
said binding target at a reference affinity; 

(b) detecting the binding affinity of said DG153 polypeptide or 
fragment thereof to said binding target to determine an 
affinity for the agent; and 

(c) determining a difference between affinity for the agent and 
reference affinity. 

27. A method for screening for an agent, which effects/modulates the 
activity of a DG153 polypeptide, comprising the steps of 

(a) incubating a mixture comprising 

(aa) a DG1 53 polypeptide or a fragment thereof; and 

(ab) a candidate agent 

under conditions whereby said DG153 polypeptide or 
fragment thereof exhibits a reference activity, 

(b) detecting the activity of said DG1 53 polypeptide or fragment 
thereof to determine a activity for the agent; and 

(c) determining a difference between activity for the agent and 
reference activity. 

28. A method of producing a composition comprising the (poly)peptide 
identified by the method of claim 25 or the agent identified by the 
method of claim 26 or 27 with a pharmaceutically acceptable carrier 
and/or diluent. 

29. The method of claim 28 wherein said composition is a 
pharmaceutical composition for preventing, alleviating or treating of 
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diseases and disorders, including pancreatic diseases (e.g. diabetes), 
obesity, and/or metabolic syndrome. 

30. Use of a (poly)peptide as identified by the method of claim 25 or of 
5 an agent as identified by the method of claim 26 or 27 for the 

preparation of a pharmaceutical composition (i) for the treatment, 
alleviation and/or prevention of pancreatic diseases (e.g. diabetes), 
obesity, and/or metabolic syndrome, (ii) for the modulation of 
pancreatic development and/or (iii) for the regeneration of pancreatic 
10 cells or tissues. 

31 . Use of a nucleic acid molecule as defined in any one of claims 1 to 
7 or 1 1 for the preparation of a medicament (i) for the treatment, 
alleviation and/or prevention of diseases or dysfunctions, including 

15 pancreatic diseases (e.g. diabetes), obesity, and/or metabolic 

syndrome, (ii) for the modulation of pancreatic development and/or 
(iii) for the regeneration of pancreatic cells or tissues. 

32. Use of a polypeptide as defined in any one of claims 1 to 6, 8 or 9 
20 for the preparation of a medicament (i) for the treatment, alleviation 

and/or prevention of pancreatic diseases (e.g. diabetes), obesity, 
and/or metabolic syndrome, (ii) for the modulation of pancreatic 
development and/or (iii) for the regeneration of pancreatic cells or 
tissues. 

25 

33. Use of a vector as defined in claim 7 for the preparation of a 
medicament (i) for the treatment, alleviation and/or prevention of 
pancreatic diseases (e.g. diabetes), obesity, and/or metabolic 
syndrome, (ii) for the modulation of pancreatic development and/or 

30 (iii) for the regeneration of pancreatic cells or tissues.. 
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34. Use of a host cell as defined In claim 22 or 23 for the preparation of 
a medicament (i) for the treatment, alleviation and/or prevention of 
pancreatic diseases (e.g. diabetes), obesity, and/or metabolic 
syndrome, (ii) for the modulation of pancreatic development and/or 

5 (iii) for the regeneration of pancreatic cells or tissues. 

35. Use of a DG1 53 nucleic acid molecule or of a fragment thereof for 
the production of a non-human transgenic animal which over- or 
under-expresses the DG153 gene product. 

10 

36. Kit comprising at least one of 

(a) a DG1 53 nucleic acid molecule or a functional fragment or an 
isoform thereof; 

(b) a DG1 53 amino acid molecule or a functional fragment or an 
15 isoform thereof; 

(c) a vector comprising the nucleic acid of (a); 

(d) a host cell comprising the nucleic acid of (a) or the vector of 
(b); 

(e) a polypeptide encoded by the nucleic acid of (a), expressed 
20 by the vector of (c) or the host cell of (a); 

(f) a fusion polypeptide encoded by the nucleic acid of (a); 

(g) an antibody, an aptamer or another effector/modulator against 
the nucleic acid of (a) or the polypeptide of (b) , (e) , or (f) 
and /or 

25 (h) an anti-sense oligonucleotide of the nucleic acid of (a). 
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Abstract 



The present invention discloses proteins secreted by the developing 
pancreas, and polynucleotides, which Identify and encode these proteins. 
The invention also relates to the use of these sequences in the diagnosis, 
study, prevention, and treatment of pancreatic diseases (e.g. diabetes), 
obesity, and/or metabolic syndrome. 
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Figure lA. Nucleic acid sequence for human DG153 
(SEQ ID NO:l; GenBank Accession Number M83751) 

1 cttcggtcct gctgtagtgc cttctgcgcc aggcccggtt caatcagcgg ccacaac^gt 
€X ctagggctca gacaccacca gccaatgagg gagggcacgt ggagccgcgt ctgggctcgc 
121 ggctcctgac caatggggaa gtggcatgtg ggagggcgcc ggggttcccc ccgccaatgg 
181 ggagctacgg cgcgcggccg ggacttggag gcggtgcggc gcggcgggtg cggttcagtc 
241 ggtcggcggc ggcagcggag gaggaggagg aggaggagga tgaggaggat gaggaggatg 
301 tgggccacgc aggggctggc ggtgcgcgtg gctctgagcg tgctgccggg cagccgggcg 
361 ctgcggccgg gcgactgcga agtttgtatt tcttatctgg gaagatttta ccaggacctc 
421 aaagacagag atgtcacatt ctcaccagcc actattgaaa acgaacttat aaagttctgc 
481 cgggaagcaa gaggcaaaga gaatcggttg tgctactata tcggggccac agatgatgca 
541 gccaccaaaa tcatcaatga ggtatcaaag cctctggccc accacatccc tgtggagaag 
601 atctgtgaga agcttaagaa gaaggacagc cagatatgtg agcttaagta tgacaagcag 
661 atcgacctga gcacagtgga cctgaagaag ctccgagtta aagagctgaa gaagattctg 
721 gatgactggg gggagacatg caaaggctgt gcagaaaagt ctgactacat ccggaagata 
781 aatgaactga tgcctaaata tgcccccaag gcagccagtg caccgaccga tttgtagtct 
841 gctcaatctc tgttgcacct gagggggaaa aaacagttca actgcttact cccaaaacag 
901 cctttttgta atttattttt taagtgggct cctgacaata ctgtatcaga tgtgaagcct 
961 ggagctttcc tgatgatgct ggccctacag tacccccatg aggggattcc cttccttctg 

1021 ttgctggtgt actctaggac ttcaaagtgt gtctgggatt tttttattaa agaaaaaaaa 

1081 tttctagctg tcaaaaaaaa aaa 



Figure IB. Amino acid sequence for human DG1S3» longer version 
(SEQ ID NO:2; GenBank Accession Number AAB08753.1) 

1 mgkwhvggrr gsprqwgata rgrdleavrr ggcgsvgrrr qrrrrrrrrm rirmrrmwatq 
61 glavrvalsv Ipgsralrpg dcevcisylg rfyqdlkdrd vtfspatien elikfcrear 
121 gkenrlcyyi gatddaatki inevskplah hipvekicek Ikkkdsqice Ikydkqidls 
181 tvdlkklrvk elkkilddwg etckgcaek-s dyirkinelra pkyapkaasa ptdl 



Figure IC. Amino acid sequence for human DG153, shorter version 
(SEQ n> NO:3; SwissPro Accession Number P55145) 

1 mwatqglava lalsvlpgsr alipgdcevc isylgrfyqd Ikdrdvtfsp atienelikf 

61 creargkenr Icyyigatdd aatkiinevs kplahhipve kiceklkkkd sqicelkydk 
121 qidlstvdUc klrvkelkki Iddwgetckg caeksdyirk inelmpkyap kaasartdl 
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